Europaisches 
Patentamt 



European 
Patent Office 



PCT/NL 2004 /QQ(K8fr 



Bescheinigung Certificate Attestation 



Die angehefteten Unterla- 
gen stimmen mit der 
ursprQnglich eingereichten 
Fassung der auf dem nach- 
sten Blatt bezeichneten 
europaischen Patentanmel- 
dung Qberein. 



The attached documents 
are exact copies of the 
European patent application 
described on the following 
page, as originally filed. 



Les documents fixes a 
cette attestation sont 
conformes a la version 
initialement deposee de 
la demande de brevet 
europ6en specifiee a la 
page suivante. 



Patentanmeldung Nr. Patent application No- Demande de brevet n° 

03078932.5 



Der Prasident des Europaischen Patentamts; 
lm Auftrag 

For the President of the European Patent Office 

Le President de I'Office europeen des brevets 
p.o. 



R C van Dijk 




Europaisches 
Patentamt 



European 
Patent Office 



Office europeen 
des brevets 



Anmeldung Nr: 

Application no.: 03078932.5 



Demande no: 



Anmeldetag: 
Date of filing: 
Date de depdt: 



18.12.03 



Anmel der/Appl 1 can t( s)/Demandeur( s ) : 

Universiteit Utrecht Holding B.V. 
Jenalaan 18a 
3584 CK Utrecht 
PAYS -B AS 



Bezel chnung der Erf 1ndung/T1 tie of the 1nvent1on/Tltre de V Invention: 
(Falls die Bezelchnung der Erflndung nlcht angegeben 1st, slehe Beschrelbung. 
If no title Is shown please refer to the description. 
S1 aucun tltre n'est 1nd1qu<§ se referer a la description.) 

Newly identified cholinephosphotransf erases and ethanolaminephosphotransf erases 

In Anspruch genommene PrloHat(en) / Priori ty(1es) claimed /Pr1or1t§(s) 
revend1qu6e(s) 

Staat/Tag/Aktenze1chen/State/Date/Flle no. /Pays/Date/Nume>o de dep6t: 



Internationale Patentkl ass1 f 1 katlon/Internatl onal Patent Classification/ 
Classification Internationale des brevets: 

C12N9/00 

Am Anmeldetag benannte Vertragstaaten/Contractl ng states designated at date c 
flHng/Etats contractants designees lors du depot: 

AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL 
PT RO SE SI SK TR LI 



03078932.5 

EPA/EP0/0EB Form 1014.2 - 01.2000 7001014 



2 



18/12 2003 17:31 FAX -> EPO-NL 111009 

009 18.12.2003 17:29:49 



LA 03.1240 EPP 



Newly identified cholinephosphotransferases and ethanolaminephosphotransferases 

5 The present invention relates to polypeptides which transfer phosphocholine and 

phosphoethanolamine and the newly identification thereof, nucleotide sequences coding 
for any of these polypeptides and nucleotide sequences complementary thereto, 
plasmids, vectors and a (micro)organism or cell comprising said nucleotide sequences. 
Furthermore, the present invention relates to processes to produce 

10 cholinephosphotransferases and ethanolaminephosphotransferases like sphingomyelin 
synthase, ethanolamine phosphorylceramide synthase, phosphatidylcholinetglycoprotein 
cholinephosphotransferase and phosphatidylcholinerglycolipid 

cholinephosphotransferases The present invention also provides the application of said 
nucleotide sequences to influence the equilibrium reactions or to develop compounds 
15 influencing the equilibrium reactions wherein said transferases are involved and the 
application of said compounds in medical use. Finally, a process has been provided to 
isolate candidates for functional genes of a previously unidentified 
enzyme from a huge database. 

Cholinephospho (Ch-P) transferases and ethanolaminephosphotransferases 
20 catalyze phosphosphingolipid synthesis in animals and parasites. Sphingomyelin (SM) is 
an abundant constituent of cellular membranes in a wide range of organisms, from 
mammals (Ullman and Radin, 1974) and nematodes (Satouchi et aL, 1993) to protozoa 
like the human malaria parasite, Plasmodium falciparum (Elmendorf and Haldar 1994). 
SM is preferentially concentrated in the outer leaflet of the plasma membrane. Its high 
25 packing density and affinity for sterols help provide a rigid barrier to the extracellular 
environment and play a role in the formation of lipid rafts, specialised areas in cellular 
membranes with important functions in signal transduction and membrane trafficking 
(Simons and Toomre, 2000; Holthuis et aL, 2001). Since the discovery of the *SM 
cycle* as a putative signalling system analogous to well-known second messenger 
30 systems like the phosphoinositide pathway, SM has emerged to the focus of interest in 
many research laboratories (Kolesnick and Hannun, 1999; Andrieu-Abadie and Levade, 
2002). 
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Since the enzyme SM synthase is also able to catalyse the reverse reaction, 
namely the formation of PC from SM and DAG (Marggraf and Kanfer, 1984, van' 
Helvoort et al., 1994) it may regulate, in opposite directions, the cellular levels of the 
bioactive lipids ceramide and DAG. The latter two are important regulators of 
membrane trafficking, cell proliferation and apoptosis (Kolesnic and Hannun, 1999; 
Bankaitis, 2002; Brose and Rosenmund, 2002, Pettus et al., 2002). Hence, one may' 
anticipate that the physiological significance of SM synthase goes beyond the formation 
ofSM. 

The subcellular localisation of the enzyme has been the subject of numerous 
studies. After the initial debate had focused on whether SM synthase is located in the 
Golgi or at the plasma membrane (Marggraf et al.; Voelker and Kennedy, 1982; Lipsky 
and Pagano, 1985), subsequent work revealed evidence for the presence of SM synthase 
activity in both membranes (Futerman et al., 1990; Jeckel et al., 1990; van Helvoort et 
al., 1994). Whether SM synthesis detected at these locations is due to the presence of 
more than one isoform of the enzyme has remained an open issue. 

Progress in understanding the biological roles of SM synthesis and its regulation 
is hampered by the fact that no successful purification of the responsible enzyme has 
been achieved. Recent work has identified a bacterial SM synthase released from 
Pseudomonas aeruginosa (Luberto et al., 2003). However, this activity is a soluble 
protein, hence in contrast to the animal enzyme that is tightly associated with the 
membrane (Ullman and Radin, 1974; Voelker and Kennedy, 1982). Other efforts 
focused on the isolation of SM synthase mutants by screening Chinese hamster ovary 
cells for resistance to a SM-directed cytolysin (Hanada et al., 1998). Instead of mutants 
with a primary defect in SM synthase, this method yielded mutants defective in serine 
palmitoyltransferase activity or blocked in ceramide transport to the site of SM synthesis 
(Fukasawaetal., 1999). 

In the light of the prior art it is therefore highly desirable to identify and purify 
SM synthases, the composing amino acid sequence and microorganisms producing the 
same based on a process comprising the expression of nucleotide sequences coding for 
30 the amino acid sequences. 
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-3- 
Definitions 

Throughout this application the following definitions will be used: 

- homology: sharing an ancestral molecule 

5 - identity: extent to which amino acid sequences are invariant, in other words unchanged 

- "conservative" or "non-conservative" mutation: replacement of one amino acid by 
another which does not change the enzyme function upon alignment (lining up 
sequences to achieve maximum levels of identity and conservation) 

- similarity: percentage of identical amino acids or amino acids with similar properties 
10 between two sequences. 



Abbreviations 



Throughout this application the following abbreviations will be used: 
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Human (Hu) 

Caenorhabditis elegans or C. elegans (Ce) 
Drosophila melanogaster (Dm) 
Saccharomyces cerevisiae (Sc) t 
Schizosaccharomyces pombe (Sp) 
Candida albicans (Ca) 
sphingomyelin (SM) 
sphingomyelinase (SMase) 
phosphatidyl choline (PC) 
diacyl glycerol (DAG) 
ceramide (CER) 
inositol phosphorylceramide (BPC) 
lipid phosphate phosphatase (LPP) 

N.(7-(/^nitrobenxo-2-oxa-13-diazole)aminocaproylsphingosine (NBD-CER) 
tf-ethylmaleimide (NEM) 

electrospray ionization tandem mass spectrometry (ESI-MS/MS) 



N-Acyl-D-sphingosine-l-phosphocholine 

1 ,2-Diacyl-sn-glycero-3-phosphocholine 
sn-1 ,2 -Diacyl glycerol 
N-Acyl-D-sphingosine 
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phytoceramide-based SMs (pfaytoSM) 
phosphorylcholine (Ch-P) 
cytidine-5' diphosphochoUne-choline (CDP-Ch) 
tubovesicular membranes (TVM) 
5 phosphatidylethanol amin e (PE) 
phosphatidylinosito] (PI) 
phosphatidylserine (PS) 
phosphatidic acid (PA) 
phosphatidylglycerol (PG) 
10 open reading frame (ORF) 
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CSS1, CSS2 and CSS3 (for candidate SM synthase family 1-3), 

-- with human CSS3al and CSS3a2 renamed as huSMSl and huSMSl and 
C. elegans CSS3al and C. elegans CSS3ct2 renamed as ceSMSl and ceSMS2 

- with human CSS3p renamed as huSMSr (SMS related), with mice CSS30 
renamed as moSMS 1, with C. elegans CSS30 as ceSMSl and Drosophila 
CSS3p\asdoSMSl 

- with C. elegans CSS3y renamed as ceSMSdr, (SMS distendy related). 

- Description of tbe figures with legends. 



Fig. 1. Selection and phylogenetic analysis of candidate SM synthases. (A) 
Animal entries in SwissProt/TrEMBLwere searched for the presence of a sequence 
motif shared by U»Ps and Aurlp proteins and then further selected on the basis of three 

25 additional criteria, as indicated. (B) Phylogenetic tree of human candidate SM synthases 
(CSS) and previously characterised members of the human LPP superfamily. (C) 
Phylogenetic tree of CSS proteins from human (fl«), mouse (Mm), Caenorhabditis 
elegans (Q?) and Drosophila melanogaster (Dm), and of Aurlp proteins from 
Saccharomyces cerevisiae (&), Schizosaccharomyces pombe {Sp) and Candida albicans 

30 (Ca). Asterisks denote CSS proteins expressed in S. cerevisiae and tested for SM 

synthase activity. SM synthases (SMS) are marked in red. SwissProt/TrEMBL accession 
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numbers of CSS proteins are: (1) Q9TYV2; (2) Q20696; (3) Q9VS60/Q9VS61; (4) 
Q96LT4; (5) Q9DA37; (6) Q86VZ5; (7) Q8VCQ6; (8) Q8NHU3; (9) Q9D4B1; (10) 
Q9XTV2; (11) Q20735; (12) Q965Q4; (13) Q9D606; (14) Q9NXE2; (15) Q8VCY8; 
(16) Q96GM1; (17) Q96MPO; (18) AAP57768; (19) Q9BQF9; (20) AAP57767; (21) 
5 Q22250; (22) Q9TXU1; (23) Q9VNT9; (24) Q9VNU1; (25) Q10022; (26) Q9D4F2; 
(27) Q8IY26; (28) Q91WB2; (29) Q96SS7; (30) Q8T8T9; (31) Q22461. Note that (4) 
Q96LT4 contains a partial protein sequence and that the complete ORF was deduced 
from a corresponding EST clone (see the Table). 

10 Fig. 2* A subset of CSS3 family members displays SM synthase activity upon 

expression in yeast (A) Immunoblots of cells expressing various CSS proteins were 
stained with antibodies recognising the V5 epitope-tagged carboxy termini of CSS 
proteins. Control denotes cells transformed with empty vector. (B) TLC separation of 
reaction products generated when NBD-ceramide (NBD-Cer) was incubated with lysates 

15 of control or CSS-expressing cells in the presence (+ ) or absence (-) of IPC synthase 
inhibitor, aureobasidin A (Aba). (C) Metabolic labelling of cells expressing human 
CSS3al/SMSl or CSS3a2/SMS2 with [ 14 C]-choline and NBD-ceramide. The lipids 
were extracted, separated by two-dimensional TLC and analysed for fluorescence and 
radioactivity. Note that SMS1- and SMS2-expressing cells, but not control cells, 

20 synthesized NBD-SM, and that this NBD-SM was labelled with [ 14 C]-choline. (D) 

Metabolic labelling of cells expressing human CSS3a2/SMS2 with [ u C]-choline. The 
lipids were extracted, deacylated by mild alkaline hydrolysis (+NaOH) or control 
incubated (-NaOH) and separated by two-dimensional TLC before autoradiography. 
Note that SMS2-expressing cells synthesized alkaline-resistant species of [ 14 C]-choline- 

25 labelled lipids (phytoSM) that were absent in control cells. 

Fig. 3. NEM inhibits yeast-associated SMase but not mammalian SM synthase. 
(A) TLC separation of reaction products generated when NBD-SM (1.8 |xM) was 
incubated with lysates of exponentially grown yeast cells (20 OD^ cells) in the presence 
30 (+) or absence (-) of A^ethylmaleirnide (NEM; 1 mM). Reactions were in 0.35 ml RBI 
buffer containing 1 Jig/ml of IPC synthase inhibitor (aureobasidin A) and incubated for 
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30 min at 3TC. (B) TLC separation of reaction products generated when NBD-ceramide 
(NBD-Cer; 1,8 mM) was incubated withlysates of HeLa cells (1/1: 2 x 10 6 cells; 1/4: 
0.5 x 10* cells) in the presence (+) or absence (-) of MEM (1 mM). Reactions were in 
0.35 ml PNS buffer, and incubated for 30 min at 37'C. lipid extraction and TLC 
analysis were as described in Materials and methods. 
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Fig. 4. Detergent extracts of human CSS3a2/SMS2-expressing yeast cells 
support SM formation from bovine brain ceramides. Lines A and B show the 
reconstituted ion chromatograms of m/z 731.6, corresponding to the m/z ratio of 
protonated 18:0 SM, during the separation of molecular species of choline-containing 
phospholipids after solid phase extraction of membrane extracts that had been incubated 
with bovine brain ceramides and egg PC. The elation time of an authentic standard of 
18:0 SM is indicated in the chromatogram by an arrow (42.3 minutes). Line A was 
derived from detergent extracts of human CSS3o2/SMS2-expressing cells and tine B 
from detergent extracts of control cells. Mass spectra recorded at the elution time of 
18:0 SM confirmed that formation of 18:0 SM occurred in CSS3a2/SMS2-containing 
extracts (top right panel), but not in control extracts (bottom right panel). 

Fig. 5. SM synthase activity in detergent extracts of human SMS1- or SMS2- 
expressing yeast cells is stimulated by externally added PC. Detergent extracts of yeast 
cells expressing human SMS1 or SMS2 were diluted 2- to 8-fold in extraction buffer, 
and then incubated with NBD-ceramide (1.8 mM) in the presence (+) or absence (-) of 
externally added PC (22 mM), as described in Materials and methods. Formation of 
NBD-SM was monitored by one-dimensional TLC. 

Fig. 6. Human SMS1 and SMS2 function as PCceramide choline 
phosphotransferases. (A) Detergent extracts of yeast cells expressing human SMS1, 
SMS2 or transformed with empty vector (control) were incubated with NBD-ceramide 
(1-8 uM) in the presence or absence of different potential head group donors (22 mM), as 
indicated. Formation of NBD-SM or NBD-IPC was monitored by one-dimensional TLC 
and quantified as described in Materials and methods. Note that addition of PI 
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stimulated formation of NBD-IPC in all three extracts (asterisks). PC, 
phosphatidylcholine; SM, sphingomyelin; Ch, choline; Ch-P, phosphorylcholine; CDP- 
Ch, cytidine-5' diphosphocholine; PE, phosphatidylethanolamine; PI, 
phosphatidylinositol; PS, phosphatidylserine; PA, phosphatidic acid; PG, 
phosphatidylglyceroL (B) Detergent extracts of SMS2-expressing and control cells were 
incubated with NBD-ceramide and [ 3 H]-choline labelled PC. The lipids were extracted, 
separated by two-dimensional TLC and analysed for fluorescence and radioactivity. 
Note that only SMS2-expressing cells synthesized NBD-SM, and that this NBD-SM 
was labelled with [ 3 H]-choline. 

Big. 7- Human SMS 1 and SMS2 exhibit reverse activity. Detergent extracts of 
yeast cells expressing human SMS 1 , SMS2 or transformed with empty vector (control) 
were incubated with NBD-diacylglycerol (NBD-DAG; 1.8 yM) in the presence or 
absence of different head group donors (22 pM), as indicated. Formation of NBD-PC 
was monitored by one-dimensional TLC and quantified as described in Materials and 
methods. 

Big- 8. Conserved sequence motifs in SMS1, SMS2 and SMSr proteins. (A) 
Alignment of human SMS I, SMS2 and SMSr amino acid sequences. Identical residues 
are highlighted in black and conservative amino acid substitutions in grey. Conserved 
residues within four homology motifs, designated Dl to D4, are highlighted in blue and 
conservative amino acid substitutions in the D2 and D4 motifs of SMSr proteins in 
green. Note that motifs D3 and D4 display similarity to the C2 and C3 domains in LPPs, 
with identical residues highlighted in red. Regions predicted to form transmembrane 
domains, TM1 to TM6, are marked by a black line. (B) Alignment of the four homology 
motifs in SMS and SMSr proteins from humans (Hu), C elegans (Ce), P. falciparum 
(Pfl and D. melanogaster {Dm). PlasmoDB accession no. of PySMSl and P/8MS2 are 
MAL6P1.177 and MAL6P1.178, respectively. Putative active site residues in consensus 
sequences are underlined. Symbols used are n, small neutral amino acid; aromatic 
amino acid; b, branched amino acid; x, any amino acid. 
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Kg. 9. Sequence listings 1-11: conserved sequence motifs of 
phosphotransferases as indicated in figure 8 B 

Sequence listings 12 - 22: polypeptide sequences of cholinephosphotransferases and 
5 ethanolaminephosphotransferases 

12. human sphingomyelin synthase 1 

13. human sphingomyelin synthase 2 

14. human sphingomyelin synthase related 

10 15. Caenorhabditis elegans sphingomyelin synthase 1 

16. Caenorhabditis elegans sphingomyelin synthase 2 

17. Caenorhabditis elegans sphingomyelin synthase 3 

18. Caenorhabditis elegans sphingomyelin synthase distantly related 

1 9. Caenorhabditis elegans sphingomyelin synthase related 
Plasmodiumfalciparum sphingomyelin synthase 1 

21. Plasmodium falciparum sphingomyelin synthase 2 

22. Drosophila melanogaster sphingomyelin synthase related. 



15 20. 
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Kg. 10. Human SMS 1 and SMS2 are encoded by ubiquitously expressed genes. 
Northern blot analysis of SMS1 and SMS2 transcripts (arrows) in various human 
tissues. Random prime-labelled human SMS1 or SMS2 cDNA was hybridised to a 
human poly(A)- RNA blot (Origene. Rockville. MD). As a control for loading, the RNA 
blot was stripped and rehybridized with a human 0-actin cDNA probe. Mobilities of 
RNA size markers are indicated. 

Kg. 11. Human SMS1 and SMS2 localise to different cellular organelles. (A) 
HeLa cells co-transfected with V5-tagged SMS 1 and myc-tagged sialyltransferase were 
incubated in the presence or absence of 10 uM nocodazole for 60 min at 37*C, fixed, 
and then co-stained with rabbit anti-V5 and mouse anti-myc antibodies. Counterstaining 
was with FTTC-conjugated goat anti-rabbit and Texas unconjugated goat anti-mouse 
antibodies. (B) HeLa cells transfected with V5-tagged SMS2 were biotinylated on ice, 
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fixed and then co-stained with mouse anti-V5 and rabbit anti-biotin antibodies. 
Counterstaining was with Texas red-conjugated goat-anti mouse and FUG-conjugated 
goat anti-rabbit antibodies. Bar, 10 pm. 

5 Fig.12. Human SMS2 is partially associated with the Golgi. HeLa cells co- 

transfected with V5-tagged SMS2 and myc-tagged sialyltransferase were incubated in 
the presence or absence of 10 raM nocodazole for 60 min at 37°C, fixed, and then co- 
stained with rabbit anti-V5 and mouse anti-myc antibodies. Counterstaining was with 
FTTC-conjugated goat anti-rabbit and Texas red-conjugated goat anti-mouse antibodies. 

10 Bar, 10 mm. 

Fig. 13. Membrane topology of human SMS1 and SMS2 proteins. (A) Schematic 
view of the predicted membrane topologies of V5-tagged SMS1 and SMS2, and the 
Golgi-associated type I membrane protein, p24> (B) Immunoblot of intact or lysed HeLa 
15 cells expressing V5-tagged SMS 1 or SMS2 and pretreated with 8 inM trypsin for 30 

min at 30°C in the presence or absence of 0.4% Triton X1G0, as indicated. Immunoblots 
were stained with antibodies against the V5 epitope (a-V5) or against p24 (a-p24). 
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Table: EST clones and primers used in this study 
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Protein name 
HuCSSlal 



Accession 
number 1 

Q9NXE2 



EST clone ID Primer pairs N- and C-terminal {5'-3f 



IRAKp961F1432 2 



HuCSSiaZ 

HuCSSla 

HuCSSSal 
(SMSJ) 

HuCSS3a2 
(SMS2) 

MwCSSloa 

MmCSS2<X 

MmCSS3al 
(SMS1) 

CeCSSly 

CeCSS3al 
(SMSJ) 

CeCSS3a2 
(SMS2) 

CeCSS3p 



Q96GM1 


IRALo962PnS9 & 


096SS7 






J-AJT1NI/J oV 1 


Q8NHU3 


IRAKp961F1433 2 


Q96LT4 


IMAGp958I131239 
Q 2 


Q8VCY8 


IRAKp961L1531* 


Q91WB2 


IMAGp998B08108 




12 2 


Q8VCQ6 


IRAKp961L2141* 


Q22250 


YK289g7 




/YK572h5 4 


Q9XTV2 


YK559h9/ 




YKSHelO 4 


Q20735 


YK428e6 / 




YK109h8 4 


Q20696 


YK524b8 4 



AAGCCATGGCTGTAGGAAACAACAC 

GGTAACTTCGGTCATGGAC 

AAGCCATGGCGGG AG GGAGA 

GGTGGCCACGGCGGGC 

AAGCCATGCCAGCTTCCCAGAG 

CCAGGCAGAGATGAGGATC 

AAGCGATGATCCTTGTAGGACTCTG 

TGTGTCATTCACCAGCCGG 

AAGCCATGAATATGTTTGATGCTGACA 

ATCCTTATAAGCCCGTGTGG 

AAGCCATGGCAGGTCCTAATCAACTC 

TCCAATTAGTCTTTTCAlTATrGC 

AAGCCATGGCTGGAGGGAG 

GGTGGCCACGGCGGGC 

AAGCCATGCCAGCTTCCCAGA 

CCAGGCAGAGATGAGCATC 

AAGCCATGAAGGAAGTGGTTTAC 

TGTGTCGTTTACCAGCCGG 

AAGCCATGTCCGTGCCAGCTTCG 

GTACCGATATCCATCATTTTG 

AAGCCATGAAAATGTCTTGGAATCATC 

AA TTTTGGCAGAGACATGGTAG 

AAGCCATGACAAACAGTTCGGAGTTC 

TTGCAATTTGTAGTTGATACGA 

AAGCCATGCTG G ATAAC AG ACCTATAC 

ATTGTGCTTTTTGGTATGATTTTTr; 



1 Swiss-Prot/TrEMBL 

2 Obtained from RZPD Deutsches Ressourcenzentrum fflr Genomforschung, Berlin Germany 

a 5°^ ^ £ ^ Umi ° SUgan °' ***** ofMedical S ^ence, University KE iST 
4 Provided by Dr. Yuji Kohara. National Institute of Genetics, Mishinia Japan 
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To achieve the targets of the invention a complementary approach was pursued 
for the identification of animal SM synthase that takes advantage of structural 
information available for enzymes catalysing analogous reactions. In contrast to most 
5 animal cells, plants, fungi and yeast do act produce SM. Instead, these organisms add 
phosphoinositol to phytoceramide to generate inositol phosphorylceramide, or IPC 
(Dickson, 1998). IPC production in the yeast Saccharomyces cerevisiae requires the 
product of the AUR1 gene (Nagiec et al., 1997). 

Sequence analysis of Aurlp proteins from different fungi revealed four 
10 conserved motifs (Heidler and Radding, 2000) two of which are similar to the C2 and 
C3 domains present in members of the lipid phosphate phosphatase (LPP) family 
(Waggoner etaL, 1999). 

LPPs play a critical role in cell signalling by controlling the conversion of 
bioactive lipid phosphate esters such as (lyso)phosphatidic acid and sphingosine-1- 
15 phosphate to their dephosphorylated counterparts. The conserved CI, C2 and C3 

domains in LPPs likely constitute the active site for cleavage of the bond between the 
lipid hydroxyl and phosphate groups (Neuwald, 1997). In the case of Aurlp, this 
reaction would represent the first step in the transfer of inositol phosphate from PI, with 
the resulting enzyme-phosphate intermediate being subjected to nucleophilic attack by 
20 the oxygen of ceramide rather than the oxygen of water used by LPPs. The presence of 
LPP-like motifs, together with the IPC synthase activity found associated with affinity- 
purified Aurlp, suggests that Aurlp is directly responsible for IPC synthesis. 

Insects and mammals contain an ethanolamine phosphorylceramide (EPC) 
synthase, another phosphotransferase, that may be structurally related to SM synthase. 
25 The insect Drosophila melanogaster (Dm) does not generate SM, but instead produces 
phosphoethanolamine phosphorylceramide (EPC (Rietveld et al., (17). Moreover, it has 
been shown that the endoplasmic reticulum of mammalian cells harbours an EPC 
synthase (Malgat et al., 1987). Furthermore, the nematode Caenorhabditis elegans (Ce) 
generates phosphorylcholine-substituted glycolipids (Lochnit, G., et al., 2000) still 
30 another phosphotransferase, that may be structurally related to SM synthase. 

Based on the above considerations a bioinformatics and functional cloning 
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strategy was developed to identify the enzyme responsible for SM synthesis in animals 
and to identify other cholinephosphotransferases. The present invention provides a 
surprisingly successful strategy which comprises a process to isolate candidates for 
functional genes of a previously unidentified enzyme with known activity from a huge 
database by combining at least four characteristics based on data from bio-informatics 
and from biochemistry, viz. 

- presence of a sequence motif shared with previously identified enzymes having a 
related function 

- biochemical function of the gene should be unknown until now 

- no structural homologues in an organism that does not contain the enzyme 

- abffity to mediate a reaction catalysed by the unidentified enzyme upon its 
heterologous expression n a organism or cell lacking said enzyme activity. 
Preferably also the presence or non-presence of transmembrane domains depending on 
the working mechanism of the enzyme in relation to the membrane is considered. 

By searching the human, mouse, C. elegans and Drosophila protein databases 
using a sequence motif shared by LPPs and Aurlp homologues and further selection on 
the basis of the three additional criteria mentioned above and, a set of cholinephospho 
transferase sequences was collected as depicted in Fig. 8B. Conserved residues within a 
lot of homology motifs have been found. 

Therefore, the present invention further provides an isolated polypeptide 
comprising one or more of the amino acid motifs selected from the group consisting of a 
sequence of at least 80%, preferably 90 % identity to any of 

(a)P-L-X-D-X(35,75)-R-R-X(8)-[YFl-X(2)-R-X(6>T 



. .. (b) C-X-D-X(3)-S-G-H-T 

(c)H-Y-rrS]-X-D-tVIl-X(3)-[FYr|-X(6)-F-X(2)-Y-H. 
30 The wording isolated means that said polypeptide has been identified in a 



18/12 2003 17:39 FAX 



•* EPO-NL ©021 

021 18.12.2003 17:38:02 



-13- 

database comprising a large amount of polypeptides. 

In particular said polypeptide is derived from the group consisting of Animalia, 
Alveolata and Kinetoplastida. Preferably, said polypeptide is selected from the group 
comprising any of the SEQ ID No. 1 - 1 1 or at least 70%, more preferably at least 80%, 
5 still more preferably at least 90% similarity to the conserved amino acid motifs listed 
therein. 

Furthermore the present invention provides a polypeptide comprising an amino 
acid sequence with at least 70%, preferably at least 80% and still more preferably at 
least 90% similarity to any of the SEQ ID No. 12 - 22. Each of these sequences 
10 comprise amino acid motifs of the SEQ ID No. 1 - 1 1 or sequences related thereto so 
that the similarity grade of at least 70 % to any of the SEQ ID No. 12 - 22 will be 
reached. 

The present invention also provides a polypeptide comprising an amino acid 
sequence with at least 20 % identity to any of the SEQ ID No. 12 - 22. In particular, a 
is polypeptide comprising an amino acid sequence with at least 30 % identity to SEQ ID 
No. 12 and therefore at least also including SEQ ID No. 13. 

Furthermore, a polypeptide is provided by the present invention comprising an 
amino acid sequence with at least 40 % identity to any of the SEQ ID No. 19-21. 
Finally, the invention provides a polypeptide comprising an amino acid sequence with at 
20 least 22 % identity to SEQ ID No. 22. 

According to a preferred embodiment of the present invention a polypeptide is 
provided with one or more of the activities selected from the group consisting of 
cholinephosphotransferases and ethanolaminephosphotransferases, preferably one or 
more of the activities selected from the group consisting of sphingomyelin synthase, 
25 ethanolamine phosphorylceramide synthase, phosphatidylcholinerglycoprotein 
cholinephosphotransferase and phosphatidylcholine:glycolipid 

cholinephosphotransferase. Furthermore, a nucleotide sequence selected from the group 
consisting of a nucleotide sequence coding for any of the amino acid sequences of a 
polypeptide with one or more of said activities and an anti sense nucleotide sequence 
30 that is complementary thereto is provided, together with a plasmid, a vector and an 
organism or micro organism and a cell line comprising said nucleotide sequences. 
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According to a further preferred embodiment of the present invention a proce 
is provided for producing sphingomyelin synthase or sphingomyelin comprising the 
expression of the nucleotide sequences coding for a polypeptide with sphingomyelin 
synthase activity in a (microorganism or cell line comprising the nucleotide sequence 
coding for such a polypeptide. Furthermore, the use of one of more of the nucleotide 
sequences coding for a polypeptide with sphingomyelin synthase activity to influence 
the reaction 



10 



CER + PC^SM + DAG 



in vivo or in vitro has been provided. Li this way a powerful means has been created to 
enhance the conversion of CER and PC into SM and DAG and vice versa. Besides this, 
the present invention also provides the use of one of more of the nucleotide sequences ' 
coding for a polypeptide with sphingomyelin synthase activity to identify or 
15 develop compounds influencing the reaction 



CER + PC**SM + DAG. 



According to still a further preferred embodiment of the present invention a 
process is provided for producing ethanolamine phosphorylceramide synthase or 
emanolamine phosphorylceramide comprising the expression of the nucleotide 
sequences coding for a polypeptide with ethanolamine phosphorylceramide synthase 
activity in a (microorganism or cell line comprising the nucleotide sequences coding for 
such a polypeptide. Furthermore, the use of one of more of the nucleotide sequences 
coding for a polypeptide with emanolamine phosphorylceramide activity to influence 
the reaction 



CER + PE ^ EPC + DAG 



30 



in vivo or in vitro has been provided. In this way a powerful means has been created to 
enhance the conversion of CER and PE into EPC and DAG and vice versa. Besides this, 
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the present invention also provides the use of one of more of the nucleotide sequences 
coding for a polypeptide with ethanolamine phosphorylceramide synthase activity to 
identify or develop compounds influencing the reaction 

5 CER + PE^EPC + DAG. 

According to another preferred embodiment of the invention the application of 
the compounds mentioned above which can be identified or developed by the use of one 
of more of the nucleotide sequences coding for a polypeptide with sphingomyelin 
10 synthase and/or ethanolamine phosphorylceramide synthase in medical use, especially 
for the manufacture of medicaments treating a disease selected from the group 
consisting of cancer, metabolic diseases, for instance the Niemann Pick A disease and 
diseases caused by parasites, for instance malaria, is provided. 

According to a further preferred embodiment of the present invention a process 
15 is provided for producing phosphatidyhglycoprotein cholinephosphotransferase or 
phosphatidyl:glycolipid cholinephosphotransferase, phosphorylcholine-substituted 
glycoprotein or phosphorylcholine-substituted glycolipid comprising the expression of 
the nucleotide sequences coding for a polypeptide with phosphatidyl:glycoprotein 
cholinephosphotransferase or phosphatidyhgiycolipid cholinephosphotransferase 
20 activity in a (micro)organism or cell line comprising the nucleotide sequences coding for 
the corresponding polypeptide. "Corresponding" stands for the nucleotide sequences 
which code for the enzyme with the corresponding activity, so either for the enzyme 
with the phosphatidyhgjycoprotein cholinephosphotransferase activity or for the enzyme 
with the phosphatidyhgiycolipid cholinephosphotransferase activity. Furthermore, the 
25 use of one of more of the nucleotide sequences coding for a polypeptide with 

phosphatidyl:glycoprotein cholinephosphotransferase or phosphatidyhgiycolipid 
cholinephosphotransferase activity to influence the reaction 



30 



glycolipid/protein + PC PC-substituted glycolipid/protein + DAG 
in vivo or in vitro has been provided. In this way a powerful means has been created to 
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enhance the conversion of glycolipid/protein and PC into PC-substituted 
glycolipid/protein and DAG and vice versa. 

Besides this, the present invention also provides the use of one of more of the 
nucleotide sequences coding for a polypeptide with phospbatidyl:glycoprotein 
cholinephosphotransferase activity or phosphatidyl.-glycolipid 
cholinephosphotransferase activity to identify or develop compounds influencing the 



reaction 



10 



20 



25 



30 



glycolipid/protein + PC ~ PC-substituted glycohpid/protein + DAG. 



According to another preferred embodiment of (he invention the application of 
the compounds mentioned above which can be identified or developed by the use of one 
of more of the nucleotide sequences coding for a polypeptide with 
phosphatidylrglycoprotein cholinephosphotransferase or phosphatidyhglycolipid 
15 cholinephosphotransferase in medical use. especially for the manufacture of a 
medicament treating a disease caused by parasitic nematodes, is provided. 

A detailed description of the various aspects of the present invention will be 
given herebelow. 

A novel family of integral membrane proteins likely responsible for SM 
synthesis in animal ceUs was identified according to the process of the present 
invention. The two human members of this family, SMS1 and SMS2, were subjected to 
a detailed biochemical analysis and it is believed that their functional assignment as SM 
synthases is justified on the following grounds. 

First, heterologous expression of SMS1 or SMS2 proves sufficient to support 
SM synthesis in the yeast S. cerevisiae, an organism lacking endogenous SM synthase 
activity (figures 2, 4 and 6). Second, in keeping with the enzymatic properties of 
mammalian SM synthase reported in the literature, SMS1 and SMS2 function as bi- 
directional lipid cholinephosphotransferases capable of converting PC and ceramide to 
SM and DAG and vice versa (figures 6 and 7). Third, SMS1 and SMS2 localize to 
cellular organelles previously established as the principal sites of SM synthesis, namely 
the Golgi (SMS1) and the plasma membrane (SMS2), figure 1 1. Fourth, SMS1 and 
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SMS2 share sequence motifs containing putative active site residues with lipid 
phosphate phosphatases and candidate PLceramide inositolphosphotransferases (figure 
8), Finally, SMS 1 and SMS2 adopt a membrane topology whereby the putative active 
site residues are facing the exoplasmic leaflet (figure 13), hence the side of the 
5 membrane where SM synthesis was found to occur. 

A multiplicity of SM synthase genes appears widely spread among organisms 
generating SM. Human SMS1 and SMS2 are highly conserved in mammals and 
evidence was found for the existence of up to three functional homologues in the 
nematode C elegans. Moreover, two orthologous sequences in the human malaria 
10 parasite P. falciparum were identified according to the process of the present invention, 
suggesting that this lower eukaryote too contains different SM synthases encoded by 
separate genes. SM synthesis in P. falciparum occurs in the Golgi apparatus as well as 
in the network of tubovesicular membranes (TVM) emerging from the parasitophorous 
vacuolar membrane during intraerythrocytic development (Elmendorf and Haldar, 
15 1994). A differential sensitivity to l-phenyl-2-acylamino-3-morpholino- 1-propanoi has 
been taken as evidence that the Golgi- and TVM-associated activities correspond to 
different enzymes (Kauer et al., 1995). Our current findings provide a novel opportunity 
to test this prediction and may facilitate the development of new drugs against parasites 
as for instance malaria. 

20 The uniform tissue distributions of transcripts for SMS 1 and SMS2 in humans 

(figure 10) suggest that most mammalian cell types would contain both SM synthase 
isoforms. Biochemical analysis of human SMS1 and SMS2 thus far revealed no 
fundamental differences in enzymatic properties, even though the possibility that 
substrate preferences are lost when enzymes are assayed in a heterologous, detergent- 

25 containing system cannot be excluded. For now, the most striking difference between 
SMS1 and SMS2 concerns their subcellular localisation. While SMS1 seems to 
represent the well-known Golgi-associated SM synthase, SMS2 primarily resides at the 
plasma membrane. Hence, SMS1 would be proximal to SMS2 with respect to receiving 
newly synthesized ceramide from the ER. A challenging prospect is that, whereas the 

30 Golgi enzyme would be responsible for generating the bulk of cellular SM, the second 
enzyme may serve a principal role in signal transduction at the plasma membrane. The 
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15 



activation of a sphingomyelinase (SMase) and subsequent liberation of ceramide at the 
plasma membrane has been recognised as an important signalling event in the regulation 
of fundamental cellular processes that include cell cycle arrest, differentiation and 
apoptosis (Kolesnic and Hannun, 1999; Pettus et al., 2002; Andrieu-Abadie and Levade, 
5 2002). By converting ceramide back to SM, plasma membrane-associated SM synthase ' 
may attenuate SMase-induced signalling. This reaction would produce as a side product 
DAG, which is a signalling molecule in its own right (Brose and Rosenmund, 2002). 
Hence, the presence of a SM synthase at the plasma membrane complicates a proper 
understanding of the role of ceramide as a specific signal-transducer. 
10 A current bottleneck for elucidating the role of SM hydrolysis in cellular 

signalling is the unambiguous identification of the agonist-stimulated SMase. Both 
neutral and acid SMases have been implicated, but their precise cellular roles in SM 
hydrolysis and signalling remain to be clarified (Andrieu-Abadie and Levade, 2002). 
Since the plasma membrane-associated SM synthase is capable of catalysing the reverse 
reaction of SM synthesis (van Helvoort et al., 1994), it is tempting to speculate that it 
may execute part of the signalling events currently attributed to tigand-induced SMase. 
The observation that both plasma membrane- and Golgi-associated SM synthases 
exhibit reverse activity brings up the question of how the directionality of these enzymes 
is regulated. One possibility is that the direction of the reaction is primarily determined 
20 . by^.relativewnce^^ 

hand, given the biological importance attributed to DAG and ceramide, one may expect 
the interconversion of these molecules to be subjected to a more elaborate form of 
control. Indeed, it has been shown that activation of SM synthesis in primary astrocytes 
is an early event associated with the mitogenic activity of basic fibroblast growth factor 
25 CRiboni et al.. 2001) and that the onset of TNFa-induced apoptosis in 

rhabdomyosarcoma cells is preceded by TNF-dependent inhibition of SM synthesis 
(Bourteele et al., 1998). Mammalian SMS1 contains a predicted SAM (sterile alpha 
motif) domain at the amino-terminus (residues 7-63 in human SMS1) that might provide 
ameans for the enzyme to interact with regulatory proteins (Schultz et al., 1997). The 
30 present identification of a family of animal SM synthases offers unprecedented 

opportunities to further clarify the biological significance of SM metabolism and its 
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regulation. 

SM synthases contain four highly conserved sequence motifs (figure 8). Two of 
these, D3 and D4, are similar to the C2 and C3 phosphatase domains in LPPs and 
include the putative catalytic histidine and aspartate residues implicated in LPP- 
mediated hydrolysis of lipid phosphate esters (Neuwald, 1997) This suggests a working 
mechanism for SM synthases in which the catalytic triad previously described for LPPs 
would function in a similar fashion. The reaction would proceed via the following steps: 
(1) binding of a two-chain choline-phospholipid (PC or SM) to a unique binding site, (2) 
nucleophilic attack on the lipid-phosphate ester bond by the histidine in D4 assisted by 
the conserved aspartate in this motif, (3) formation of a cholinephospho-histidine 
intermediate and release of DAG or ceramide, facilitated by the histidine in D3 acting as 
a base, (4) nucleophilic attack of the Cl-hydroxyl of ceramide or DAG on the 
cholinephospho-histidine intermediate, assisted by the histidine in D3, (5) release of SM 
or PC from the active site to allow another round of catalysis. This model provides a 
framework for site-directed mutagenesis and kinetic studies to reveal the working 
mechanism of SM synthases. 

There is a striking parallel between animal SM synthase and IPC synthase in 
yeast like human SMS1, the yeast enzyme is Golgi localised and contains the key D4 
motif in the Golgi lumen at the start of the last membrane span (Levine et al M 2000). 
This suggests that animal SM synthase and yeast IPC synthase may have evolved from a 
common ancestor. In contrast, the SM synthase recently identified from Pseudomonas 
aeruginosa (Luberto et al, 2003) lacks the key motifs of animal SM synthase and is a 
soluble, rather than an integral membrane protein. Hence, it appears that SM synthases 
arose at least twice during evolution. 

Database searches using the catalytic site sequences from LPPs yielded, in 
addition to SM synthases, numerous other proteins containing partially conserved 
phosphatase motifs. In many cases, the biochemical functions of these proteins are not 
known. 

Given the present findings, the possibility that some of these LPP-like proteins 
do not serve as phospholipases but instead catalyse novel kinds of synthetic or 
transphosphatidylation reactions deserves consideration. For example, mammals and 
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insects synthesize ethanolamine phosphoiylceramide, a sphingolipid analogous to SM 
(Malgat et al., 1986; Rietveld et al.. 1999). Whether production of EPC and SM 
involves a single transferase that accepts both PC and PE as substrates or requires two 
separate enzymes has remained an open issue. However, Drosophila lacks SM and 
generates only EPC (Rietveld et al., 1999). Our finding that Drosophila contains an 
SMS-related protein, but no SMS proteins, strongly suggests that the SMS-related 
proteins described in this study function as dedicated EPC synthases. This possibility is 
currently under further investigation. 

Phosphorylcholine (PC), a small haptenic molecule, is found in a wide variety of 
prokaryotic organisms, i. e. bacteria, and in eukaryotic parasites such as nematodes, as 
well as in fungi. Linked to parasite-specific glycoprotein glycans or glycolipids, it is 
assumed to be responsible for a variely of immunological effects, including invasion 
mechanisms and long-term persistence of parasites within the host. Numerous reports 
have indicated various effects of PC-substituted molecules derived from parasitic 
nematodes on signal transduction pathways in B and T lymphocytes, displaying a highly 
adapted and profound modulation of the immune system by these parasites. 

The Nematoda, comprising parasitic and free-living species (e.g. C. elegans), can 
be regarded as promising prototypic systems for structural analyses, immunological 
studies and biosynthetic investigations. In this context, Ascaris suum, the pig parasitic 
20 nematode, is an ideal organism for immunological studies and an excellent source for 
obtaining large amounts of PC-substituted (macro)molecules. C. elegans, as a 
completely genome-sequenced species and expressing parasite analogous PC-substituted 
structures, together with the possibility for easy in vitro cultivation, represents a 
conceptual model for biosynthetic studies, whereas filarial parasites represent important 
model systems for human pathogens, especially in developing countries. Ccurrent 
knowledge on the tissue-specific expression of PC epitopes, structural data of 
glycoprotein glycans and glycosphingolipids bearing this substituent and biological 
implications for the immune systems of the respective hosts are reviewed by Lochnit et 
al., 2000. Since the elucidated structures of these compounds are clearly different from 
30 those present in the host, the biosynthesis of nematode-specific, PC-substituted 

compounds would be a promising target for the development of new anthelminthic 



25 
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strategies. 

It will be clear to the skilled in the art that by the present invention yielding 
purified cholinephosphotransferases and ethanolaminephosphotransferases a new field 
of development for new compounds with valuable properties as for instance drugs has 
been made available. The present invention will further be eludicated by the following 
examples without in any way restricting the broad scope thereof. 

Materials and methods 

Chemicals 

SphingosyI-(NBD-hexanoyl)-phosphocholine (NBD-SM) and (NBD-hexanoyl)- 
ceramide (NBD-Cer) were from Molecular Probes (Eugene, OR) and OleoyHNBD- 
hexanoyl>phosphocholine (NBD-PC) from Avanti Polar lipids (Alabaster, AL). 
[methyl- 14 C]-choline was fromlCN Biomedicals (Irvine, CA) andI^-3-phosphatidyl DST- 
methyl- 3 H]-choline,l,2-dipalaiitoyl from Amersham Pharmacia (Piscataway, NJ). NBD- 
DAG was produced from NBD-PC by treatment with phospholipase C from B. cereus as 
described (Trotter, 2000). Egg ^^phosphatidylcholine, lyso-phosphatidylcholine, L~rx- 
phosphatidylethanolamine, L^x-phosphatidylglycerol, I^a-phosphatidylseririe, soybean 
L-a-phosphatidylinositol, dioctanoyl L-a-phosphatidic acid, SM (from bovine brain), 
choline, phosphocholine, cytidine S^diphosphocholine, O-tricyclotS^.l^ldec-Q-yl 
dithiocarbonate potassium salt (D609), N-ethylmaleimide (NEM) and phospholipase C 
(Bacillus cereus) were from Sigma Aldrich (St Louis, MO). 



Selection, cloning and expression of CSS sequences 

Animal entries in the Swiss-Prot/TrEMBL protein sequence database were 
searched using Prosite rtittp://w ww .expasv.org/fonk/scanprositeA against sequence 
motif H-EYFWHJ-Xa-^IWl-Xa-tGAl-Xa-IGSTA] and then further selected based on 
three additional criteria as described under Results. Candidate SM synthases (CSS) were 
assembled into different groups by multiple sequence alignments and phylogenetic trees 
generated with ClustalW and PHYLJP. Open reading frames (ORFs) in selected CSS 
sequences were PCR amplified using Taq polymerase (MBI Fermentas, Hanover, MD) 
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according to infonnation provided in Table I PCR products were cloned into yeast 
expression vector pYES2. WS-ffis-TOPO 8 (Ihvitrogen Corporation, Carlsbad, CA> and 
the resulting plasmids used to transform S. cerevisiae strain IAY11 (MATa ura3-52 
his3-A2001eu2-3,-112trp-A901 ade2-101 ade3-A853, provided by Ian Adams, MRC- 
5 LMB, Cambridge, UK). Transformants were grown in complete minimal uracil dropout 
medium containing 2% (w/v) galactose. Expression of CSS proteins was verified by 
Western blot analysis with mouse anti-V5 antibodies (Ihvitrogen). PCR amplified ORFs 
of human SMS1 and SMS2 were cloned into mammalian expression vector 
pcDNA3. l/VS-His-TOPO 8 (Invitrogen) and the resulting plasmids (SMS1- 
10 V5/pcDNA3.1 and SMS2-V5/pcDNA3.1) used to transfect HeLa cells. 

SM synthase assay on cell lysates 

CSS-expressing yeast cells were lysed at 100 OD^ml by bead bashing at 4°C in 
15 RBI buffer (120 mM K-glutamate, 15 mM KC1, 5 mM NaCl, 2 mM MnClj, 2 mM 
MgCl 2 , 20 mM Hepes-KOH, pH 7.2) containing freshly added protease inhibitors 
(Holthuisetal.,(1998). 

Lysates were centrifuged at 700 g for 10 min at 4°C and supematants (0.25 ml 
per reaction) preincubated with NEM (l mM) for 10 min at 37°C. The reaction was 

20 initiated by addition of 5 nmol of NBD-Cerin 0.1 ml RBI buffer and incubation 

continued for 15 min. Lipids were extracted by addition of 1 .7 ml chloroform:methanol 
(1:2.2), dried under N 2 and subjected to butanol/water partitioning. Lipids recovered 
from the butanol phase were separated by one-dimensional TLC for 30 min in solvent h 
chloroform/methanol/4.2 M NH 4 OH (9:7:3, v/v), followed by 30 min in solvent It 

25 chlorofonn/methanoyacetic acid (45:30:7, v/v). Fluorescent images were recorded on a 
STORM 860 Imaging Analysis System (Molecular Dynamics, Sunnyvale, CA) and 
... analysed with hnageQuant software. _ 

Metabolic labelling 

30 CSS^xpressing yeast cells (0.5 OD^) were inoculated in 5 ml medium containing 10 
uCi [m^y/-» 4 C]-choline and 10 nmol NBD-Cer. Cells were grown for 16 h at 30*C, 
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washed in water, and lipids were extracted by bead bashing in 
HaO/methanol/chloroform (5:16:16, v/v). The organic extracts were dried under N 2 , 
subjected to butanol/water partitioning, and lipids recovered from the butanol phase 
were deacylatedby mild base treatment using 0.2 N NaOH in methanol (60 min, 30°C). 
After neutralizing with 1 M acetic acid, lipids were extracted with chloroform and 
separated by two-dimensional TLC using solvent I in the first, and solvent II in the 
second dimension. Radio-labelled lipids were detected by exposure to BAS-MS imaging 
screens (Fuji Photo Film Co., Japan) and read out on aBIO-RAD Personal Molecular 
Imager. 



Preparation of detergent extracts 

SMSl/SMS2-expressing yeast cells were lysed at 100 OD^vol by bead bashing 
15 at 4°C in lysis buffer (50 mM Tris-HCl, pH 7.0, 1 mM EDTA, 0.3 M sucrose) 

containing protease inhibitors. Post nuclear supernatants were prepared as above and 
loaded onto a 60% (w/w) sucrose cushion and then centrifaged at 100.000 g for 60 min 
at 4°C. Membranes derived from ± 500 OD m of cells were resuspended in 1 ml ice-cold 
extraction buffer (50 mM Tris-HCl, pH 7.0, 10% glycerol, 1% Triton X-100, 1 mM 
20 MnClz) containing protease inhibitors, incubated for 60 min at 4°C while rotating and • 
then centrifoged at 100.000 g for 60 min at 4°C. Detergent extracts were diluted to a 
final protein concentration of 1.5 mgfrnl in extraction buffer and 100 fll aliquots were 
snap frozen in liquid N 2 and stored at -80°C. 

25 SM synthase and reverse transferase assays on detergent extracts 

Detergent extracts were diluted 5- to 8-fold in extraction buffer, and 50 Ml mixed 
with 5 Hi of a 1.2 mM head group donor stock (e.g. CDP-choline, PC, PD or with 15 
jxCi (350-700 ftnol) of t 3 H]-choline-containing PC prepared in extraction buffer. 
30 Following a 15 min preincubation at 37°C, reactions were started by addition of 220 \H 
RB2 buffer (50 mM Tris-HCl, pH 7.0, 5 mM MgCl 2 , 1 mM MnCl 2 , 1 mM NEM) 
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containing 5 nmol NBD-ceramide (SM synthase assay) or 5 nmol NBD-DAG (reverse 
transferase assay) and incubated for 60 min at 37°C. Lipids were extracted, separated by 
TLC, and analysed as above. 

Mass spectrometry 

Detergent extracts (400 Ml) were mixed with bovine brain ceramide (1 80 pM) 
and egg PC (500 uM) in a total volume of 500 ul After 15 min at 37°C, reactions were 
diluted 4-fold in RB2 buffer and then incubated for another 60 min at 37°C. Lipids were 
extracted in chloroform:methanol (1:2.2) and subjected to butanoFwater partitioning. 
Residual Triton X-100 was removed by solid phase extraction on Si-60 columns (500 
mg stationary phase) in acetone. Phospholipids were eluted in methanol/chloroform 
(19:1, v/v), dried under N„ dissolved in methanol/chloroform (2:1, v/v) and then 
separated by HPLC on two 250 x 4.6 mm Lichrosphere RP-18 end-capped columns 
15 (Merck, Darmstadt, Germany) in series (Brouwers et al., 1998). The column effluent was 
split in a 10:1 ratio, with the smaller fraction going to the mass spectrometer. Mass 
spectrometry was performed on a Sciex API-365 triple quadrupole mass spectrometer 
(Applied Biosystems, Nieuwerkeric a/d Dssel, The Netherlands). Positive ions were 
generated by a turbo-ion spray ionisation source operating at +5.5 kV ionisation 
20 ... potential. N a (2 1/min) was used as drying gas at a.temperature of 350"C. The 

declustering potential (cone voltage) was set to 45V and the focus potential to 220V. 
Mass spectra from mass to charge ratio (m/z) 650 amu to 950 amu were recorded at a 
speed of 120 amu/s. 



25 



Mammalian cell transection and immunofluorescence microscopy 



. HeLa cells were cultured in DMEM medium containing 10% fetal calf serum. 

Cells were grown on glass coverslips to 40% confluence and then transfected with 
human SMSl-V5/pcDNA3.1, SMS2-V5/pcDNA3.1 and myc-tagged 
30 sialyltransferase/pCB7 constrncts using Lipofectamine 2000 Ohvitrogen). After 48 h, 
cells were fixed in 3% paraformaldehyde/PBS and processed for immunofluoresence 
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microscopy (see Supplementary materials for details). For cell surface biotinylation, 
cells were incubated twice with 0.5 rng/ml sulfo-NHS-SS-biotin (Pierce) in PBS for 20 
min on ice, quenched in 10 mM glycine/PBS for 20 xnin and then washed in PBS prior 
to fixation. Immunostaining was with rabbit anti-V5 antibodies (Sigma), mouse anti-V5 
5 antibodies (Ihvitrogen), mouse 9E10 anti-myc antibody (SanverTech, CA), rabbit anti- 
biotin antibodies (Rockland, Gilbertsville, PA), FTTC- or Texas Red-conjugated goat 
anti-rabbit and goat anti-mouse antibodies (Jackson Laboratories, West Grove, PA). 
Images were obtained using a Nikon D-eclipse CI confocal miscroscope. 

10 Protease protection assay 

HeLa cells transfected with human SMS1-V5 or SMS2-V5 constructs were 
washed, scraped and then lysed in ice-cold PNS buffer (120 mM K-glutamate, 15 mM 
KC1, 5 mM NaCl, 2 mM MnCl 2 , 0.8 mM CaC! 2 , 2 mM MgCl 2 , 1.6 mM EDTA and 20 

15 mM Hepes-KOH, pH 7.2) by 20 passages trough a 26 3 / 4 gauge needle. The homogenate 
was treated with 8 mM trypsin (Sigma Aldrich) in the presence or absence of 0.4% 
Triton X-100 for 30 min at 37°C. Alternatively, cells were only washed and then 
trypsin-treated in PNS buffer as above. Homogenates and cell suspensions were 
transferred on ice, trypsin inhibitor (Sigma Aldrich) was added to 160 mM and samples 

20 were processed for Western blot analysis using mouse anti-V5 antibodies and rabbit 
anti-p24 protein (Gommel et aL, 1999, provided by B. J. Helms, Faculty of Veterinary 
Medicine, Utrecht). 

SDS-PAGE and Western Blot Analysis 

25 Protein extracts were prepared in SDS/Urea sample buffer (20 mM Tris-HCl, pH 

6.8, 1 mM EDTA, 4.5 M urea, 2.5% SDS, 0.01% bromophenol blue) containing fresh 
protease inhibitors (Holthuis et aL, (1998). Extracts were heated for 10 min at 50°C, 
centrifuged briefly at 14,000 g and then resolved by SDS-PAGE on 10% polyacrylamide 
mini-gels. For Western blotting, nitrocellulose transfers were blocked for 90 min in 

30 PBS, 5% Protifar (Nutricia, Zoetermeer, The Netherlands), 0.2% Tween-20 (Blotto). 
Primary antibody incubations were performed for 1 h in Blotto. Detection was with 
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hoxsheradish peroxidase-conjugated goat anti-rabbit or anti-mouse IgG, using enhanced 
chemiluminescence (Amersham Pharmacia Biotech). 



Immunofluorescence Microscopy 
5 HeLa cells were grown on coverslips, transfected with human SMS 1- 

V5/pcDNA3.1,SMS2-V5/pcDNA3.1 and myc-tagged sialyltransferase/pCB? (Spronget 
al., 2001). 48 h after transfection, cells were fixed in 3% paraformaldehyde/PBS for 30 
min at room temperature, quenched in PBS containing 50 mM NH 4 C1, and then blocked 
and permeabilized for 1 h in PBS containing 0.5% BSA, 0.1% saponine (blocking 
10 buffer). Cells were labelled with mixtures of primary antibodies (1:200) for 1 b in 

blocking buffer, washed for 45 min in blocking buffer with three buffer changes, and 
then incubated for 20 min with 10% goat serum in blocking buffer. For immnnostaining 
of intact cells, incubation with primary antibodies occurred before fixation and was 
performed for 1 h at 4°C in PBS containing 0.5% BSA. Cells were counterstained with 
fluorescently labelled secondary goat antibodies (1 :75) for 30 min in blocking buffer, 
washed in blocking buffer for 45 min with three buffer changes, and then rinsed briefly 
in PBS and water. Mounting was in Mowiol 4-88 (Calbiochem, La Jolla, CA) 
containing 2.5% i,4-diazabicycio[2.2.2.]octane (Sigma). 



15 
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Example 1 

Identification of candidate SM synthases from the animal database 

Figure 1 A shows an outline of the bioinformatics approach used to identify 

5 candidate sequences for SM synthases from the animal database. Candidate SM 

synthases were identified based on the following criteria: 1) presence of a sequence 
motif, H-tYFWH^-Xi-D^VIJn^-CGAl-Xa-LGSTA], shared by previously characterised 
LPPs and Aurlp homologues; 2) biochemical function should be unknown; 3) no 
structural homologues in S. cerevisiae, since this organism lacks SM; 4) presence of 

10 multiple (>2) transmembrane domains, since the enzyme mechanism is 

intramembranous and because LPPs and Aurlp proteins have 6 predicted membrane 
spans. Sequences conforming to all 4 criteria were subsequently used as queries in a 
BLAST search to track down homologous sequences that were missed in the initial 
search due to deviations in the LPP/Aurlp motif. This approach yielded 9 human, 9 

15 mouse and 9 C. elegans sequences that could be grouped into three major protein 

families, designated CSS1, CSS2 and CSS3 (for candidate SM synthase family 1-3; 
Figure IB, C). Except for the presence of a common sequence motif, CSS proteins 
displayed no significant sequence similarity to Aurlp proteins or to LPP family 
members with known biochemical functions. However, human CSS1(J1 is identical to 

20 PRG1 , a neuron-specific candidate phosphatidic acid phosphatase with a role in axon 
growth and regenerative sprouting (Brauer et al., 2003) 

Database accession numbers of CSS proteins are listed in the legend of Figure 1. 

Example 2 

25 A subset of CSS3 family members displays SM synthase activity 

To investigate whether any of the three CSS families indeed contained SM 
synthases, the open reading frames of human, mouse and C elegans members for which 
full-length cDNAs could be obtained were cloned into a yeast multicopy, GAL1 
30 promotor plasmid in frame with a carboxy-terminal V5 epitope. The resulting plasmids 
were used to transform wild-type yeast and the transformants were shifted to galactose- 
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containing medium to induce expression of recombinant proteins. Expression of 
proteins was verified by Western blot analysis using anti-V5 antibodies (Figure 2A). 
Thus, 13 of the 27 selected CSS sequences were expressed and analysed for SM 
synthase activity (marked by asterisks, Figure 1C). To this end, yeast cells expressing 
5 CSS protein were lysed and incubated with fluorescent C^NBD-ceramide (NBD-Cer), a 
known substrate of mammalian SM synthase. Incubations were performed in the 
presence of N-ethylmaleimide (NEM), a potent inhibitor of yeast-associated SMases 
that does not affect mammalian SM synthase (Figure 3). Reaction mixtures were next 
subjected to one-phase lipid extraction and the lipids separated by one-dimensional 
10 TLC. 

Figure 2B shows that in lysates of control cells NBD-Cer was converted 
exclusively into NBD-IPC. The same was true for lysates of cells expressing members 
of the CSS1 and CSS2 protein families. However, in lysates of cells expressing human 
CSS3al, human CSS3o2, C. elegans CSS3al or C. elegans CSS3a2, NBD-Cer was 

15 converted to a second product with an Rf value identical to that of NBD-SM. The 

identity of this product as NBD-SM was coimrmed by electrospray ionisation tandem 
mass spectrometry (ESI-MS/MS; data not shown). Moreover, metabolic labelling of 
cells expressing human CSS3ccl or CSS3oc2 with ["Q-chotine in the presence of NBD- 
Cer resulted in the production of radio-labelled NBD-SM (Figure 2C). When NBD-Cer 

20 . ..was omitted, we noticed that CSS3o2-expressingcells.generated small amounts of 

[ ,4 C]-choline-labeUed lipids that were absent in control cells and distinct from PC and 
lysoPC. In addition, these lipids were resistant to mild alkaline hydrolysis, suggesting 
that they represent phytoceramide-based SMs (phytoSM; Figure 2D). Importantly, ESI- 
MS revealed that bovine brain ceramides incubated with detergent extracts of CSS3o2- 

2S expressing cells, but not control cells, are converted to SM (Figure 4). Together, these 
results indicate that CSS3a proteins do not only use NBD-Cer, but also recognise 
- naturally occurring ceramides as substrates for SM synthesis.- 

Human CSS3al and CSS3oc2 share 57% sequence identity and are highly 
conserved in mammals (human - mouse > 90%). Based on the results presented herein, 

30 we propose to rename these proteins SMS1 and SMS2, respectively. C elegans CSS3al 
(ceSMSl) and CSS3a2 (ceSMS2) share 22-27% sequence identity with human SMS1 
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and SMS2, and apparently represent their functional counterparts in the nematode 
(Figure 2B). C elegans contains a third, CSS3a-reIated protein, termed ceCSS3ct3. 
Although we have not tested this protein for SM synthase activity, the finding implies 
that G elegans is equipped with three independent SM synthases (Figure 1C). BLAST 
5 searches for orthologous sequences identified two proteins in the human malaria parasite 

• Plasmodium falciparum (PlasmoDB accession no. MAL6P1.177 and MAL6P1.178; see 
also below). Hence, a multiplicity of SM synthase genes appears a general feature of 
organisms generating SM. 

Apart torn CSS3a/SMS proteins, the CSS3 family contains a second cluster of 
10 CSS3P or SMS-related (SMSr) proteins with members in humans, mice, C elegans, 
and Drosophila (Figure 1C). Human SMSr is highly conserved in mammals (human- 
mouse: 95%), shares > 40% sequence identity with its orthologues in C elegans and 
Drosophila, and is about 34% identical to human SMS 1 and SMS2. Heterologous 
expression of human or G elegans SMSr did not yield any detectable SM synthase 
15 activity (Figure 2B, right panel). The C elegans CSS3yprotein, renamed SMSdr, is only 
distantly related to SMS and SMSr proteins (<22% identical; see also Figure 1C) and its 
ability to mediate SM synthesis was not tested. 

Example 3 

20 Human SMS 1 and SMS2 function as PCxeramide cholinephosphotransferases with 

reverse activity 

• SM synthesis in animals proceeds by the liberation of phosphoxylcholine from PC and 
its subsequent transfer onto the primary hydroxyl of ceramide. To establish whether 

25 human SMS1 and SMS2 function as PCiceramide cholinephosphotransferases, their 
enzymatic characteristics were analysed in more detail. Hence, different donors of 
choline-P were tested as substrates for SM synthesis. To this end, detergent extracts of 
cells expressing human SMS1 or SMS2 were incubated with NBD-Cer and the 
formation of NBD-SM was monitored by TLC. A dilution of extracts proved necessary 

30 to render SMS-mediated synthesis of NBD-SM dependent on externally added head 
group donors (Figure 5). Under these conditions, free phosphorylcholine (Ch-P) and 
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CDP-choline (CDP-Ch) did not support SMS1- or SMS2-mediated SM synthesis 
(Figure 6A). PC, on the other hand, was efficiently recognised as substrate. SM itself 
was also used as a donor of the phosphorylcholine group. Lyso-PC was a very poor 
substrate. These results suggest that human SMS1 and SMS2 are transferases that 
require two fatty chains on the choline-P donor molecule in order to be recognised 
efficiently as a substrate. 



Example 4 

The reaction of human SMS 1 or SMS2 with non-choline phospholipids 

10 

We next investigated the possibility that human SMS1 or SMS2 would use non- 
choline phospholipids as substrates. None of the phospholipids tested (PE, PI, PS, PA, 
PG) other than PC supported SM formation (Figure 6A). Moreover, when extracts of 
cells expressing SMS2 were incubated with PC containing [^-choline, the formation 
15 of radiolabelled NBD-SM was observed (Figure 6B). Hence, it appears that SMS 1 and 
SMS2 directly and specifically recognise, the choline head group on their substrates. 

Example 5 

The reaction of sphingomyelin and diacyl glycerol 

20 

Previous work suggested that mammalian SM synthase is also capable of 
catalysing the reverse reaction, namely the formation of PC from SM and DAG 
(Marggraf and Kanfer, 1984; van Helyoort et'al., 1994). 

To investigate whether this Was also the case for human SMS1 and SMS2, 

25 extracts of cells expressing these proteins were incubated with NBD-DAG and the 

formation of NBD-PC monitored by TLC. Addition of SM induced NBD-PC formation 
in extracts of SMS1- or SMS2-expressing cells, but not in control cell extracts (Figure 
7). Strikingly, PC itself proved more efficient than SM in stimulating SMS1- or SMS2- 
dependent NBD-PC formation. In contrast, addition of non-choline phospholipids (e.g. 

30 PI) had no effect. These data suggest that human SMS 1 and SMS2, rather than 



18/12 2003 17:54 FAX 



-> EPO-NL 0016 

016 18.12-2003 17:48:23 



-31- 

functioning strictly as SM synthases, are transferases capable of using PC and SM as 
phosphocholine donors to produce PC or SM, dependent on the relative concentrations 
of DAG and ceramide as phosphocholine acceptors, respectively. 

Mammalian SM synthase was found to be sensitive to the bacterial PC- 
5 phospholipase C inhibitor, D609 (Luberto and Hannun, 1998). 

In lysates of SMS 1- or SMS2-expiessing yeast cells, D609 inhibited SM synthesis in a 
dose-dependent manner. The extent of inhibition observed for SMS 1 -mediated SM 
synthesis (50% at 100 mg/ml D609) was comparable to that reported for mammalian 
SM synthase (Luberto and Hannun, 1998). SMS2 proved two-fold less sensitive to the 
10 drug. 

Hence, human SMS1, SMS2 and the mammalian SM synthase activity described 
in the literature share many enzymatic characteristics. 

Example 6 

15 Structure and expression of human SMS 1 and SMS2 



Figure 8 A shows a sequence alignment of human SMS1 , SMS2 and SMSr. 
Hydrophobicity analysis predicted six membrane-spanning alpha helices connected by 
hydrophilic regions that would form extramembrane loops. A comparative analysis with 
SMS and SMSr sequences from mice, G elegans, P. falciparum and Drosophila 
revealed that the number and spacing properties of transmembrane helices are well 
conserved. In addition, SMS proteins contain four highly conserved sequence motifs, 
designated Dl, D2, D3 and D4 (Figure 8 A and B, residues highlighted in blue). Motifs 
D3 (C-G-D-X 3 -S-G-H-T) andD4 (H-Y-T-X-D-V-Xa-Y-Xe-F-Xa-Y-H) are similar to the 
C2 and C3 motifs in LPPs (shared residues highlighted in red) and include the histidine 
and aspartate residues (underlined) that form a catalytic triad mediating the nucleophilic 
attack on the lipid phosphate ester bond Neuwald (1997). 

As in LPPs, these residues are juxtaposed to transmembrane segments 4 and 6 of 
the SMS proteins and consequently would be oriented toward the same side of the * 
membrane. This would suggest tjiat motifs D3 and D4 are part of the catalytic site 
responsible for liberating cholinephosphate from PC during SM synthesis. Motifs Dl 
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(P-I.P-D) and D2 (R-R-X 8 .Y-X a -R.X 6 . T ), oa the other hand, appear entirely unique to 
SMS proteins and are located in the first extramembrane loop and third transmembrane 
helix, respectively (Figure 8A). The SMSr proteins found in humans, mouse, Drosophila 
and C. elegans each contain exact copies of the Dl and D3 motifs, yet exhibit one or 
5 more conserved amino acid substitutions in motifs D2 and D4 (Figure 8A and B, 
residues highlighted in white on grey. 

Previous work showed that some members of the mammalian LPP superfamily 
are expressed in only a limited set of tissues (Waggoner et al., 1999, Brauer et al, 1997). 
Since differences in tissue distribution would provide a possible explanation for the 

10 existence of two different SM synthase isoforms in mammals, we investigated the 

expression profiles of human SMS1 and SMS2. As shown in Figure 10, Northern blot 
analysis detected the presence of a low abundant 3.8 kb transcript for SMS1 in human 
brain, heart, kidney, liver, muscle and stomach. A major 1.9 kb transcript for SMS2 was 
expressed to a similar level in all of the above human tissues. These results suggest that 

15 human SMS1 and SMS2 ate encoded by ubiquitously expressed genes. 



Example 7 

Subcellular localisation and membrane topology 

20 SM synthesis occurs in the Golgi complex as weU as at the plasma membrane of 

inammalian cells. Endosomes have been put forward as another major site of SM 
synthesis (Kallen et al., 1994) but this has been disputed (van Helvoort et al., 1994). 

It is not known whether SM synthase activity detected at these locations is due to 
the presence of more than one iso-enzyme in the cell. This led us to examine the 

25 subcellular distribution of V5-tagged versions of human SMS1 and SMS2 in transfected 
HeLa cells using immunofluorescence microscopy. As shown in Figure 1 1 A, SMS 1 was 
concentrated in the perinuclear region where it displayed extensive colocalisation with 
sialyltransferase, a marker of trans Golgi cisternae. This colocalisation was also 
observed in cells treated with nocodazole, a drug causing fragmentation of the Golgi by 

30 disrupting the microtubular network, hence confirming the association of SMS1 with the 
Golgi apparatus. SMS2 displayed a different localisation pattern and was primarily 
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concentrated at the plasma membrane (Figure 1 IB). A portion of SMS2 was also found 
in the perinuclear region where it colocalised with sialyltransferase (see Figure 12). This 
Golgi-associated pool of SMS2 unlikely represents newly synthesized material en route 
to the cell surface, since it was also observed in cells after a 4-hour chase with 
5 cycloheximide. There was no substantial colocalisation of SMS2 with markers of the 
endosomal/lysosomal system (e.g. EEA1, CD63, internalised transferrin; data not 
shown). Whether SMS2 cycles between the Golgi and the plasma membrane, and 
thereby passes through endosomes remains to be established. 

Since SM synthesis takes place in the exoplasmic leaflet of the Golgi and the 
10 plasma membrane (Futerman et ah, 1990; van Helvoort et aL, 1994), the putative 

catalytic residues in motifs D3 and D4 of SMS 1 and SMS2 would be oriented toward 
the Golgi lumen and cell surface, respectively, whereas their termini would be located 
on the opposite, cytosolic side of the membrane (Figure 13A). To test this prediction, we 
investigated the sidedness of the V5-tagged COOH termini of human SMS1 and SMS2 
15 by protease protection analysis. When HeLa cells transfected with the SMS2- 

VSassociated type I membrane protein, p24 construct were trypsinised, the VS tag 
remained intact, unless cells were lysed or treated with detergent prior to incubation 
with the protease (Figure 13B). Trypsinisation of lysates from SMS1-V5 expressing 
cells in the absence of detergent resulted in a complete removal of the V5 tag. Under 
20 these conditions, the Golgi-associaied type I membrane protein, p24 (Gommel et aL, 
1999). 

was largely protected. Collectively, these results indicate that the COOH termini of 
SMS1 and SMS2 are cytosolic. Consequently, the putative active site residues in these 
proteins would be positioned on the exoplasmic leaflet (Figure 13 A), hence where SM 
25 synthesis was Found to occur. Together, these findings demonstrate yet another level of 
similarity between SMS1, SMS2 and the SM synthase activity previously described in 
mammalian cells. 



30 
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Claims 

1. An isolated polypeptide comprising one or more of the amino acid motifs 
selected from the group consisting of a sequence with at least 80% identity to any of 

5 

(a) P-I^X-D-X(35,75)-R-R-X(8)-[YF]-X(2)-R-X(6)-T 

(b) C-X-D-X(3)-S-G-H-T 

10 (c) H-Y-[TS]-X-D-[VI]-X(3)-[FYI]-X(6)-F-X(2)-Y-H. 

2. A polypeptide according to claim 1 that is derived from the group consisting of 
Animalia, Alveolate and Kinetoplastida. 

IS 3. A polypeptide according to claim 1 or 2 selected from the group comprising any 
of the SEQ ID No. 1 - 1 1 or at least 70% similarity thereto. 

4. A polypeptide according to claim 1 or 2 comprising an amino acid sequence 
with at least 70 % similarity to any of the SEQ ID No. 12 - 22. 

20 

5. A polypeptide according to any one of the previous claims comprising an amino 
acid sequence with at least 20 % identity to any of the SEQ ID No. 12 - 22. 

6. A polypeptide according to claim 5 comprising an amino acid sequence 
25 with at least 30 % identity to SEQ ID No. 12. 

7. _ A polypeptide according to claim 5 comprising an amino acid sequence with at 
least 40 % identity to any of the SEQ ID No. 19-21. 

30 8. A polypeptide according to any one of the claims 1 - 3 comprising an amino 
acid sequence with at least 22 % identity to SEQ ID No. 22. 
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9. A polypeptide according to any one of the claims 1-6 with sphingomyelin 
synthase activity. 

10. A polypeptide according to any one of the claims 1-5 and 7 with 
5 ethanoiamine phosphorylceramide synthase activity. 

11. A polypeptide according to any one of the claims 1-5 and 8 with one or more of 
the activities selected from the group consisting of phosphatidylcholinerglycoprotein 
cholinephosphotransferase and phosphatidylcholinerglycolipid 

10 cholinephosphotransferase. 

12. A nucleotide sequence selected from the group consisting of a nucleotide 
sequence coding for any of the amino acid sequences as described in claim 9 and an anti 
sense nucleotide sequence that is complementary thereto. 

15 

13. A nucleotide sequence selected from the group consisting of a nucleotide 
sequence coding for any of the amino acid sequences as described in claim 10 and an 
anti sense nucleotide sequence that is complementary thereto. 

20 14. A nucleotide sequence selected from the group consisting of a nucleotide 

sequence coding for any of the amino acid sequences as described in claim 11 and an 
anti sense nucleotide sequence that is complementary thereto. 

15. A plasmid comprising any of the nucleotide sequences described in any of the 
25 claims 12, 13 or 14. 

16. A vector comprising any of the nucleotide sequences described in any of 
the claims 12, 13 or 14. 

30 17. ■ A (micro)organism or cell line in which any of the nucleotide sequences 
described in claim 12 was introduced. 
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1 8. A (micro)organism or cell line in which any of the nucleotide sequences 
described in claim 13 was introduced 

19. A (microorganism or cell line in which any of the nucleotide sequences 
described in claim 14 was introduced. 

20. A process for producing sphingomyelin synthase comprising the expression of 
any one of the nucleotide sequences described in claim 12 in a (micro)organism or cell 
line of claim 17 and the isolation of sphingomyelin synthase. 

21. A process for producing sphingomyelin comprising the expression of the 
nucleotide sequences described in claim 12 in a (microorganism or cell of claim 17 and 
the isolation of sphingomyelin. 

22. Use of one of more of the nucleotide sequences of claim 12 to influence the 
reaction 

CER + PC^SM + DAG 

in vivo or in vitro. 

23. Use of one of more of the nucleotide sequences of claim 12 to identify or 
develop compounds influencing the reaction 

CER + PC^SM + DAG 

in vivo or in vitro. 



24. A process for producing ethanolamine phosphorylceramide synthase comprising 
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the expression of any one of the nucleotide sequences described in claim 13 in a 
(microorganism or cell line of claim 18 and the isolation of ethanolamine 
phosphorylceramide synthase. 

5 25. A process for producing ethanolamine phosphorylceramide comprising the 

expression of the nucleotide sequences described in claim 13 in a (micro)organism or 
cell line of claim 1 8 and the isolation of ethanolamine phosphorylceramide. 

26. Use of any one of the nucleotide sequences of claim 13 to influence the 
10 reaction 



CER + PE EPC + DAG 



in vivo or in vitro. 

15 



27. Use of any one of the nucleotide sequences of claim 13 to identify or 
develop compounds influencing the reaction 



CER + PE^EPC+DAG 

20 

in vivo or in vitro. 

28. The application of the compounds of claim 23 or 27 in medical use. 



29. The application of the compounds of claim 28 for the manufacture 

of medicaments treating a disease selected from the group consisting of cancer, 
metabolic diseases and diseases caused by parasites. 

30. A process for producing phosphatidyhglycoprotein 
cholinephosphotransferase or phosphatidyhglycolipid cholinephosphotransferase 
comprising the expression of any one of the corresponding nucleotide sequences 
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described in claim 14 in a (microorganism or a cell line of claim 19 and the isolation of 
phosphatidyl:g]ycoprotein cholinephosphotransferase or phosphatidyltglycolipid 
cholinepbosphotransferase. 

31. A process for producing phosphorylchohne-substituted glycoprotein or 
phosphorylcholine-substituted glycolipid comprising the expression of the 
corresponding nucleotide sequences described in claim 14 in a (microorganism or cell 
of claim 19 and the isolation of phosphorylcholine-substituted glycoprotein or 
phosphorylcholine-substituted glycolipid . 

32. Use of one of more of the nucleotide sequences of claim 14 to influence the 
reaction 

glyco lipid/protein + PC ^ PC-substituted glyco lipid/protein + DAG 
in vivo or in vitro. 

33. Use of one of more of the nucleotide sequences of claim 14 to identify or 
develop compounds influencing the reaction 

glyco lipid/protein + PC - PC-substituted glyco lipid/protein + DAG 
in vivo or in vitro. 



25 34. The application of the compounds of claim 33 in medical use. 

35. The application of the compounds of claim 34 for the manufacture 
of a medicament treating a disease caused by parasitic nematodes. 

30 36. A process to isolate candidates for 



15 



20 



i 
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enzyme with known activity from a huge database by combining at least four 
characteristics based on data from bio-informatics and from biochemistry, viz. 

- presence of a sequence motif shared with previously identified enzymes having a 
related function 

5 - biochemical function of the gene should be unknown until now 

- no structural homologues in an organism that does not contain the enzyme 

- ability to mediate a reaction catalysed by the unidentified enzyme upon its 
heterologous expression in an organism or cell lacking said enzyme activity. 

10 37. A process to isolate candidates for functional genes according to claim 36 
characterized by also considering the presence or non-presence of transmembrane 
domains depending on the working mechanism of the enzyme in relation to the 
membrane. 



20 



25 
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Abstract 

Newly identified cholinephosphotransferases and emanolarmnephosphotransferases 

The present invention relates to polypeptides comprising one or more of the 
5 amino acid motifs selected from the group consisting of a sequence with at least 80% 
identity to any of 

(a)P-L.X-D-X(35,75)-R.R-X(8)-rYF]-X(2)-R-X(6).T 

(b) C-X-D-X(3)-S-G-H-T 

(c) H-Y-rrS]-X-D-[VTI-X(3)-[FYII-X(6>F-X(2)-Y--H > 

10 which transfer phosphocholine and phosphoethanolamine, nucleotide sequences coding 
for any of these polypeptides and nucleotide sequences complementary thereto, 
plasmids, vectors and a (microorganism or cell comprising said nucleotide sequences. 
Furthermore, me present invention relates to processes to produce 
cholinephosphotransferases and ethanolaminephosphotransferases like sphingomyelin 

15 synthase, ethanolamine phosphorylceramide synthase, phosphatidylcholinerglycoprotein 
cholinephosphotransferase and phosphatidylcholine:glycolipid 
cholinephosphotransferase. The present invention also provides the application of said 
nucleotide sequences to influence the equilibrium reactions or to develop compounds 
influencing the equilibrium reactions wherein said transferases are involved and the 

20 application of said compounds in medical use. Finally, a process has been provided to 
isolate candidates for functional genes of a previously unidentified enzyme from a large 
database by isolating candidates for functional genes of a previously unidentified 
enzyme with known activity from a huge database by combining at least four 
characteristics based on data from bio-informatics and from biochemistry, viz. 

25 - presence of a sequence motif shared with previously identified enzymes having a 
related function 

- biochemical function of the gene should be unknown until now 

- no structural homologues in an organism that does not contain the enzyme 
" ability to mediate a reaction catalysed by the unidentified enzyme upon its 

30 heterologous expression in an organism or cell lacking said enzyme activity. 
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FIG. 9 

Sequence listings 

Sequence listings 1-11: conserved sequence motifs of phosphotransferases as indicated in 

figure 8 B; 

sequence listing 1: partial polypeptide sequences of hu SMS1 
sequence listing 2: partial polypeptide sequences of hu SMS2 
sequence listing 3: partial polypeptide sequences of ce SMS1 
sequence listing 4: partial polypeptide sequences of ce SMS2 
sequence listing 5 : partial polypeptide sequences of ce SMS3 
sequence listing 6: partial polypeptide sequences of pf SMS\ 
sequence listing 7: partial polypeptide sequences of ^ SMS2 
sequence listing 8: partial polypeptide sequences of hu SMSr 
sequence listing 9: partial polypeptide sequences of dm SMSr 
sequence listing 10: partial polypeptide sequences of ce SMSr 
sequence listing 11: partial polypeptide sequences of ce SMSdr 

Sequence listing 12: JimSMSI 

MKEVVYWSPKKVADWI^ 

GQRLLDMffiTLKMEHH^ 

EMKIPMPELERSQYPMEW 

TFFDHFNRVQWAFSICEIN 

YVTTLPWGMHFNCSPKLF 

MLTLTYU?IKEYSPR 

WWYHTMANQQVLKE^ 

VHLSRQVKYSRLVNDT 
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Sequence listing 13: huSMS2 

MDnETAKLEEF&ENQPSDPTOTYARPAEPV^ 
YIQIAMPTESRNKFPLEWWKTG3AFTYAVF 

FDYIDRVKWAFSVSElNGIILVGLWITQWLFLRYKSIVGPvPJCFnGTLYLYRCITMYVT 

TLPWGMHFQCAPKLNGDSQAKVQRmRLISGGGLSTO 

YLFIKEYSPRHFWWYFfLICWLLSAAGnCILVAHE 

NEHSlIJCVSSQTNFLSRAWWFPIFYFra 

VQKIGEDNEKST 



Sequence listing 14: ceSMSl 
MKMSW1SMQYTNTYGSIADDNGDEEKAENSEGAAA 

AS SRS SHHDKQKPGETKKSGD GKMDDDDnTTARSSSRRICGS AAS SSD S ETADD APL 

LPDEGPSHAVRLEMPGDKPASPHDRFPKTPLKT^^ 

YPLTPPLPDWFEUPKIPWGLRLCENI^ 

MRCITMMVTPWKADEDFECSPRFGENATFSLWMRGVWSMFGLGLNLFDNQKVV^ 
CGDYIYSGHTLVLVVSALFIGEYSPPJtFY^ 

S YF ACTRVF WAYHTQ AAHPSIRI.S VQNHQAKEF WFPLLRWFEGDIRRP VPRRFD CPIS 
YSQTPWYSFLPWFSFFFQSNPMMFPGAFTFYHTQNHSKSTMSLPK 



Sequence listing 15: ceSMS2 

MTNSSEFTDVLQSPJDPCVSNGrvO^ProPEPTPIRKEFTCEDTFH 

TAFLCLMLSAFLNFITXTVIHDVVPRQPLPDLTFMnPQQRWAW 

FTDFIJIHQRWIVIJRRTFIXGAIN/ry' GLRAVJLGVTFLPPSFHNRDEICQPQVFiTRTAMYG 

MEIATRFLTYYITLGLTSGQDKmCGDLMFSGHTVVLTFMYFVQLQYTPRGLV^ 

APITFLGIAALVVSGGHYTMDVIIAYWL^ 

WFWLCYWESDVADGKLVNKWbTWPI£^^ 
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Sequence listing 16: ceSMS3 

MGSVSKTVISARGASPDDEQNGTKNGISNGSEWAKCIFLFFFLFIAGMSNWAVLAYT 
HDYWRESLPDIWSLVSEQRWASSLGDFCVALCIVMLGALLVmQHRGTILKRWFC 
AGTLYAMRSVTLAATQLPSGYTDNQGRC1TOQ\^ 

MLCGDLLFSGmi.VMVTCSI^VAYYLPKSIKPLQWSHVACLIGMICMTISRTHYTID 

VVIAYWLSNMWRMYHAYCEVDMCMERRKSILYSWWPCRIVDWLE 

RCQLPWIIRSTPRGQERGGASAESSDSSVTMCDMTTSHHQKHVSISSSSTYPLPC 



Sequence listing 17: .r/SMSI 

MffiGNTNANTraQOTTNNKJEKNFIKEKDIIN^ 

DKHYNHKDSYSDNKGKKTLRSNFSDIQD^ 

KMLKIFIRRLVCATnGnGICIQCYFlILSDTYYKTGDEPLKD 

NGSIMFFLAITLLRFGLFCPLIXSrriLIRlILMLSFIYCIRSFFIW 

LVE^YIFYIJITAQVYECTDLVISGHTAF^ 

SRFHYTVDVLMGYWGGSWl^YHYLVDVAARRYALKrSWPQTYGFSGSFTDRFQ 

VFQYFlRArTYLEALDHFJVINFALSYDKEWNCFCSC 

FSDHFYHSYAGNGTYbtt^TTRNnKEFKRFFGMNKKN 



Sequence listing 18: p/SMS2 
MNKYEYFYSPIQLLHLILLLRPILYENYT^ 

S TMEEEIL S EYFXCKIIXIKLMF ALLFIXIALHQ GFFMIYSDS YYKSNTQPLSDRIHDLF 

GOTPKWISYKLSNTLIAILTLSFLKin.FNSIYLSIAnCRFLYWG 

LETCLPLESGhTFLFNLIXJEKINT^VYVCADLWS 

TF S CFTY"AniIGFIHYTSDVLLGnFGWMFSF YHIMLDIS 

KPFFIJIFFVARIFFKIIPYLEGLNYTLDY 

DKnnsTYSDHLYHSYAGDGTINFLFWKFLKTIKKGLHK 
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Sequence listing 19: huSMSr 

MAGPNQLCIRRWTlXHVAVWOCDEGFFBYVDIIX^IKHRLDGniXTLTEYDLRSPPL 
EIKVLGDIKRLMLSVIIKLQKIHIDVLEEMGYNSDSPMGSMTPFISALQSTDWLCNGEL 
SHDCDGPni)IJ^SDQYQYMNGKNKHSVRRIJ>PEYWKlTI^CIYVFIWGFTSFIMVIV 

CSlMGTVFlXRCFTMFVTSLSVPGQHLQCTGKIYGSVWEKLHRAFAIWSGFGIvfrLTG 
^^FSGHTVVLTMLOT^ 



Sequence listing 20: rfmSMSr 

MCDGEIGDPVTQPRSEGGGLVTMDQETRTHYLX>AATDKHLTNGSPDPEPVDPLLVA 
QWSIENVTSWATCMEHFSRTIXDCLRQEAHJGEVI-LSLTEEDVEDMRYKLGYKLIEG 

AECPSYVSDCDTYLRMOGRYVPPEWTAMSLGYSPVVTVm^LmviVHERVPDM 
KRYPPIPDIFIJ5KVPHQ J WAF1^CEITGSLIJ'TIWVVVLTFHKYRLVIXEUU'FALAGTV 
*LI-RCVTMIJTSLSVPG'IHLQCSQKDFA1DDPNVDMV QALnRMSRAYRIWSGLGMSI 

<^VRTCGDYMFSGHWALTLI^ITriEYTPR>^YFUiXLTWLI^MFGIFl f lIAAHEHY 

SIDVFVAFYTreRLFLYYHTLANNRALMQSDSKRTOVWFPMFSYFESSVDGMVPNEY 

DTLGSLIIXJnEQIFKAKDQLAISVKRCWLDAPLSGNSSAHMFGGESEQCLRNGTASA 

AFFSPHQSIJGGIXK3QRSQTHLNSAQSGSSTPTPATSAPTBISLPTQKKTFRDASVX)PFS 
RTTFAAVQQAENVKDSPMKEKKHL 



Sequence listing 21 : eeSMSr 

MU5NRPIPADPNEWRCEDVGNWIKOGMAKYADIJAMKHKVDGKCLLALTDTDIJC 
DPPVSINCLGDIKKnLFAIEFLSQKWEIGNSGVHHRSTPNGNGPSLKNSKDGLLVEYN 
EQNHLSISGEDVYTTTRRAEIVEDEETLLDTLAKSSDGTSTVQLISREEnRQVERPDTY 
FKSVAKIJJAFAYSSLSFLMTSFVMVLVHDRVPD'nCTYPPLPDlVLDNVPHIPWAFDM 
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CETIGLVIAVVWFTVLFFHNQRVrVARR^ 

ARPNTmQEKLHKAFHIWSNLGMSUIGVRSCGDYMFSGHTTVrM 
WTGLHTFTWVE^JCF AIFLILAAHEHYSID VFIAF YIS SRMFLYYHAYAYNHAG1TATD 
YRMRTWFPLGWFFEYGSQGKVENEFSLPINI^ 
WAAKQNGNSKNHTKKHN 



Sequence listing 22: ceSMSdr 
MCKSCENYGKFFFAMUQCIDNSSW^ 

YSmDDISVKIDGKQLEVRKWPTIXATAMVGVGWLSNEVALAWVHERVPDDYHPLP 
DLFFSHFPEVTCEREKffGILGKEEKEEKK^ 

QHRWIVIRRWFCIAMAYSFRALCVTIFQWWSE^TYCAPKSNSSLEL 
WSAGIEQLRPREIX^GDIJVSGHTLTIFTAFLWKTYAPQREQPLSFII^^ 
LARKHYMIDrVLGYTVSTPJFMEYHAIAASY^^ 
NMHNHLLLYNRSTSSKNVSTLKKSRRSFE 
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FIG. 10 
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FIG. 11 
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FIG. 12 
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